ISO-learning approximates a solution to the inverse-controller problem in an unsupervised behavioural paradigm

نویسندگان

  • Bernd Porr
  • Christian von Ferber
  • Florentin Wörgötter
چکیده

In the previous article we have introduced an isotropic algorithm for temporal sequence learning (ISO-learning). Here we embed this algorithm into a formal nonevaluating (“teacher-free”) environment which establishes a sensor-motor feedback. The system is initially guided by a fixed reflex reaction which has the objective disadvantage that it can only react after a disturbance has occurred. ISO-learning eliminates this disadvantage by replacing the reflex-loop reactions with earlier anticipatory actions. In this article we will analytically demonstrate that this process can be understood in terms of control theory showing that the system learns the inverse controller of its own reflex. Thereby this system is able to learn a simple form feed-forward motor control.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ISO Learning Approximates a Solution to the Inverse-Controller Problem in an Unsupervised Behavioral Paradigm

In "Isotropic Sequence Order Learning" (pp. 831-864 in this issue), we introduced a novel algorithm for temporal sequence learning (ISO learning). Here, we embed this algorithm into a formal nonevaluating (teacher free) environment, which establishes a sensor-motor feedback. The system is initially guided by a fixed reflex reaction, which has the objective disadvantage that it can react only af...

متن کامل

Isotropic Sequence Order Learning

In this article, we present an isotropic unsupervised algorithm for temporal sequence learning. No special reward signal is used such that all inputs are completely isotropic. All input signals are bandpass filtered before converging onto a linear output neuron. All synaptic weights change according to the correlation of bandpass-filtered inputs with the derivative of the output. We investigate...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

The use of inverse quadratic radial basis functions for the solution of an inverse heat problem

‎In this paper‎, ‎a numerical procedure for an inverse problem of‎ ‎simultaneously determining an unknown coefficient in a semilinear ‎parabolic equation subject to the specification of the solution at‎ ‎an internal point along with the usual initial boundary conditions ‎is considered‎. ‎The method consists of expanding the required‎ ‎approximate solution as the elements of the inverse quadrati...

متن کامل

Designing a quantum genetic controller for tracking the path of quantum systems

Based on learning control methods and computational intelligence, control of quantum systems is an attractive field of study in control engineering. What is important is to establish control approach ensuring that the control process converges to achieve a given control objective and at the same time it is simple and clear. In this paper, a learning control method based on genetic quantum contr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003